Estimating the Error Distribution of a Single Tap Sequence without Ground Truth
نویسندگان
چکیده
Detecting beats, estimating tempo, aligning scores to audio, and detecting onsets are all interesting problems in the field of music information retrieval. In much of this research, it is convenient to think of beats as occuring at precise time points. However, anyone who has attempted to label beats by hand soon realizes that precise annotation of music audio is not possible. A common method of beat annotation is simply to tap along with audio and record the tap times. This raises the question: How accurate are the taps? It may seem that an answer to this question would require knowledge of “true” beat times. However, tap times can be characterized as a random distribution around true beat times. Multiple independent taps can be used to estimate not only the location of the true beat time, but also the statistical distribution of measured tap times around the true beat time. Thus, without knowledge of true beat times, and without even requiring the existence of precise beat times, we can estimate the uncertainty of tap times. This characterization of tapping can be useful for estimating tempo variation and evaluating alternative annotation methods.
منابع مشابه
Estimating the Error Distribution of a Tap Sequence without Ground Truth1
Detecting beats, estimating tempo, aligning scores to audio, and detecting onsets are all interesting problems in the field of music information retrieval. In much of this research, it is convenient to think of beats as occuring at precise time points. However, anyone who has attempted to label beats by hand soon realizes that precise annotation of music audio is not possible. A common method o...
متن کاملCorrelation-based Intrinsic Image Extraction from a Single Image: Supplement
In order to quantify the performance of our intrinsic image extraction method, we need an error metric for estimating the difference between output and ground truth images. This error metric should represent the degree of visual similarity between the estimated and the ground truth image. The appearance of an image is mainly determined by the contrast between pixels, not intensity values. There...
متن کاملPerformance evaluation of EPM and MPSIAC Models for determination of Erosion Status of Shahriari Watershed
Soil erosion is one of the most important environmental issues in developing countries, including Iran that there is inaccurate information about its amount and distribution. For this purpose, the accuracy and distribution of erosion classes obtained from EPM and MPSIAC models as compared to BLM as ground truth values were evaluated in Shahriari watershed. First, the required data and informati...
متن کاملAdaptive Segmentation of Document Images
A single-parameter text-line extraction algorithm is described along with an efJicient technique for estimating the optimal value for the parameter for individual images without need for ground truth. The algorithm is based on three simple tree operations, cut, glue and jlip. An XYtree representing the segmentation is incrementally transformed to reflect a change in the parameter while intrinsi...
متن کاملStereo Ground Truth with Error Bars
Creating stereo ground truth based on real images is a measurement task. Measurements are never perfectly accurate: the depth at each pixel follows an error distribution. A common way to estimate the quality of measurements are error bars. In this paper we describe a methodology to add error bars to images of previously scanned static scenes. The main challenge for stereo ground truth error est...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009